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Abstract 


In steel-making continuous casting is one of the widely used process used for casting 
billets, blooms and other semi-finished products. The mold region of a continuous caster 
plays a crucial role in determining the overall efficiency of the process. The solidified shell 
that develops in the mold region directly influences all the other operations downstream and 
becomes a major factor for the attainment of required structures and mechanical properties 
in the semi-finished products. 

For proper understanding of the primary solidification process that occurs in the mold 
region, a heat transfer based optimization analysis of the mold has been carried out in 
this study to evaluate the maximum casting velocities under various conditions. To achieve 
the optimum values of the operational variables in the mold design problem, an emerging 
methodology in evolutionary computation, called genetic algorithms, coupled with other tra- 
ditional and non-traditional optimization techniques has been adopted. In order to consider 
the physical phenomena of solidification process taking place in the mold, several objective 
functions were constructed, expressing casting velocity as a function of a total of eighteen 
process variables. A steady state heat balance along with the pertinent equations for mold 
oscillation and slag vitrification was used for this purpose. A number of constraints arising 
out of physical considerations were also considered. 

The reliability of the model proposed in this study was verified by comparing the results 
obtained by this rigorous optimization scheme with a significant amount of industrial data 
available in the literature. A functional relationship of the optimum casting speed with 
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design variables was found by optimizing the parameters within predefined ranges. The 
efficiency, search power and complexity of different optimization techniques such as genetic 
algorithms, simulated annealing and differential evolution had also been compared in this 
work. 
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Chapter 1 


Introduction 


Continuous casting process is one of the major technological developments in the 20th 
century which is gradually replacing the ingot casting operation for the production of the 
semi-finished blooms, billets and slabs. Only a brief introduction is necessary to comprehend 
the continuous casting revolution that is shaping up the steel-making process, as a tremen- 
dous amount of research effort has gone into this particular area in recent years and many 
detailed reports are available in the literature[l, 2, 3]. The continuous casting involves a 
scciuence of steps which is shown schematically in Figure 1.1. Liquid steel is first transferred 
from a steel-making ladle to a tundish. The tundish holds the steel during casting process 
and enables control of steel flow into a mold [4]. 

The mold region of a continuous caster is known to play a very crucial role in the 
overall efficiency of the process [3]. In this region primary cooling of liquid steel takes place. 
Therefore, formation of a solidified shell on the slab or billet surface is initiated in this region. 
The process of solidification continues as the billet or slab passes downward in the caster 
guided by the support rolls. The slab or billet is then additionally cooled by a series of 
water sprays located in the secondary cooling region which includes support rolls enclosed in 
a roller apron. The whole solidification process comes to an end after passing through this 
region. Finally, the product comes down through the withdrawal rolls. Later, the billet or 
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slab passes through the bending and straightening rolls and is cut into pieces of predefined 
length. 

Semi-finished steel, produced by continuous casting process, is generally of acceptable 
quality in terms of its cleanliness, cracks, segregation and shape. Furthermore, the inherent 
advantages of low-cost, high yield, and fiexible operation along with the ability to achieve a 
high quality cast product [5] has been increasingly prompting the steel industries worldwide 
to adopt the continuous casting technology for the last two decades. The continuous cast 
steel can directly be rolled or hot charged to a re-heating furnace without cooling for an 
intermediate inspection which adds to the efficiency of this process. Casting of good quality 
products through this route however requires a proper understanding of the key operations of 
the ciistcr that influence the operator’s ability to control and monitor the process effectively. 
The key factors those need to be studied are fluid flow, heat flow, mechanical properties of 
the steel at the elevated temperature, stress- generation and solidifleation [6]. 

The fluid flow in the tundish and mold has a strong influence on the cleanliness of the 
steel as it is responsible for the maximum inclusion float-outs. In addition production of 
finished products with low inclusions content requires control of the following aspects [7]: 

(i) delivery of clean steel from tundish to caster. 

(ii) minimization of oxygen transfer from air and the refractories. 

(iii) prevention of the exogenous inclusion pickup from the refractories, ladle, tundish or 
mold powder. 

(iv) usage of optimum mold powder. 

The extraction of heat in mold and spray cooling zone is the major operation taking 
place in a continuous caster. In order to predict the temperature field in the solidifying 
steel and in the mold wall several mathematical models have been proposed so far. The 
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models arc able to simulate the casting operations depending upon the actual measurement 
of surface heat flux in the different cooling regions [8]. 

In the mold, heat is transferred from the liquid steel to the cooling water via an air gap 
which separates the mold and the strand, the mold wall, and the interface between the outer 
mold wall and the cooling water. Among these, the air-gap provides the largest resistance 
to heat flow. In fact, the amount of heat transferred ■ is almost inversely proportional to 
the air gap width [5, 6]. However the air gap width depends in a complex manner on the 
operational variables, and is almost impossible to predict accurately using a mathematical 
expression. 'The air gaj) width also varies both in longitudinal and transverse directions, 
resulting in a non-uniform heat extraction pattern. Heat transfer alfccts various factors of 
the steel making process. Among the hictors which are influenced by heat extraction from 
the mold the following arc of importance [9]. 

(i) For a i)articular steel composition the extent of thermal shrinkage and the phase trans- 
formation shrinkage is governed by the temperature distribution of the solidified shell. 
This in turn is affected by the mold heat transfer. 

(ii) The ability of the shrinking solidified shell to resist the ferrostatic pressure depends on 
the evolution of the gai). The gap evolution is affected by the mold wall temperature, 
the solidified shell temperature and its thickness, and the mold heat extraction [5, 10]. 

(iii) The mold shape, especially the taper of the billet mold, is one of the important factors 
influenced by the heat transfer i)rocess. The narrow face of the slab mold and the walls 
of a billet mold are tapered to compensate for shrinkage. The billet molds may distort 
because of the differential thermal expansion coupled with the lack of constraints. The 
resultant change in taper may cause oscillation marks to form on the billet surface 
which in turn temd to cudargr^ the strand-mold gap width. Slab molds arc less prone to 
this problem [10]. 
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All these factors make computation of mold heat transfer from the first principle very 
difficult. Among them, control of the oscillation marks is one of the prime requirements for 
quality casting. In addition to the heat transfer process oscillation marks also depend on 
oscillation stroke and frequency, lubrication properties of oil or mold powder, magnitude of 
taper at the meniscus level, and all other variables affecting the mold distortion. In billet 
casting with oil lubrication, oscillation marks occur by mechanical interaction of mold with 
the billet during the downward stroke of oscillation cycle. In slab casting with mold powder 
this occurs because of the pressure generated in the mold flux channel between the mold and 
strand during the down stroke of oscillation [5, 6]. 

It follows from the above discussion that in continuous casting of steel, control of both 
fluid flow and heat transfer are very crucial. This is also very important in order to achieve 
the product quality and associated productivity [8]. An analysis of the solidification phe- 
nomena under continuous casting conditions also helps in understanding the segregation 
problems and its remedies. The solidification pattern that develops in the mold region di- 
rectly influences all the operations downstream, and becomes a key factor in the attainment 
of the required texture in the finished products [11]. The studies relating to continuous 
casting mold are rather limited to the traditional heat-transfer mechanisms. The rigorous 
research w'orks of Brimacombe et al. [5] have achieved what perhaps could easily be achieved 
through the numerical solutions of the heat-transfer equations obtained by a very sophisti- 
cated experimentation scheme [3]. In order to achieve the better understanding, one at this 
stage needs an approach drastically different from what has been done so far. In this work, 
we are going to elaborate one such methodology for tackling the continuous casting mold 
design problem. In order to do this we have adopted an optimization scheme of the decision 
variables using Genetic Algorithms (GA), an emerging methodology in evolutionary com- 
putation [3] which has been tested against a number of other optimization techniques. Till 
now, only a few applications of GA are reported in the field of process metallurgy. However, 



it is likely to influence the future research in this area quite significantly. 
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Figure 1.1: Basic operations of a continuous casting machine 





Chapter 2 


Fundamentals of Optimization 
Methods 


2.1 Traditional Methods 

An engineering design or decision making problem has an objective of minimizing or 
maximizing a function and simultaneously have a requirement for satisfying some constraints 
arising due to space, strength, or stability considerations. A constrained optimization prob- 
lem comprises of an objective function together with a number of equality or inequality 
constraints. Often lower bound or upper bounds on design or decision variables are also 
specified. Considering N design or decision variables a single-objective constrained opti- 
mization problem is formulated in the following fashion [15]. 

Minimize or Maximize fi^) > where X = [xi] 

Subject to 

^;(AO>0 i = l,2,..J 

hi{X) = Q Z = 1,2...L 

where, z = 1, 2, ...N 


> 


( 2 . 1 ) 
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\\ here, the function f{X) is the objective function which is to be optimized. The functions 
pj(A) and hi{\) are inequality and equality constraints, respectively. In the above formu- 
lation, J such inequality constraints and L such equality constraints have to be satisfied. 

Any point satisfies a constraints, if the left side expression of the constraint 
at that point agrees with the right-side value by the relational operator between them. A 
point(or solution) is defined as a feasible point (or solution) if all the equality and inequality 
constraints and the variable bounds are satisfied at that point. All the other points are known 
as infeasible points. 

2.1.1 Fundamentals of The Penalty Function Method 

Penalty function methods transform the basic optimization problem in such a way that 
numerical solutions arc obtained by solving a sequence of unconstrained optimization prob- 
lems [12]. This alternative unconstrained formulation is done by adding penalty terms for 
each constraint violation. If a constraint is violated at any point, the objective function is 
penalized by an amount depending upon the extent of constraint violation. The penalty 
terms vary in the way the penalty is assigned [12]. 

Some penalty methods cannot deal with infeasible points at all and even penalize feasible 
points which are close to the constraint boundary. These methods are known as Interior 
penalty methods. In these methods, every sequence of the unconstrained optimization finds 
a feasible solution. The other kind of the penalty methods penalize infeasible points but 
do not penalize feasible points. These methods are known as Exterior penalty methods. 
In these methods, every sequence of the unconstrained optimization finds an improved yet 
infeasible solution [15]. If the optimum solution is an interior point in the feasible region, 
one sequence of the exterior penalty method should find the optimum point. On the other 

hand, if the optimum point is on a constraint boundary, several sequences may be necessary. 

The optimization problem described above, is converted into an unconstrained opti- 
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mization problem by constructing a function of the form [12] 

k = (4(A-, r,) = /(X) + n y G,fc(A')l (2,2) 

3=1 

where Gj is some function of the constraint and is a positive constant known as the 
penalty parameter. The second term on the right hand side of the Equation (2.2) is known 
as the penalty term. If the unconstrained optimization of the ^-function is repeated for a 
sequence of values of the penalty parameter r^, (^' = 1, 2, ....N), the solution may be brought 
to convergence for the original problem stated by Equation (2.1) [12]. Thus, the penalty 
function methods are also known as sequential unconstrained optimization techniques. 

In the interior formulation, some of the popularly used form of are given by [12] 


Inverse Penalty: G, = { — 

Log Penalty: Gj = —ln{gj{X)} 

In ca.se of the exterior penalty function formulation the commonly used forms of the 
function Gj are [15] 


Infinite Barrier Penalty: Gj = 


Ela(A')| 


b-€/ j 

where J denotes the set of violated constraints at the current point. 


Parabolic Penalty: Gj = max{0, g^{X)y 

The convergence of the unconstrained minimization of is illustrated for the simple prob- 
lem, given below [12]. 

Find A'' = Xi which minimizes f{x) — qxi 
subject to 


giiX) = Xi-P>0 
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Exterior Method Interior Method 

Figure 2.1: The gradual convergence of the Penalty Function Method 


It can be seen from the Figure 2.1 that the unconstrained minima of the function 
(^{X, Tk) converge to the optimum point X* as the parameter is increased sequentially 
for the exterior method. On the other hand, the interior method shown in the figure gives 
convergence as the parameter is decreased sequentially [12]. 

The sequential nature of penalty function method allows a gradual approach towards 
the criticality of the constraints [12]. In addition, the sequential process permits a graded 
approximation, which can used in the analysis of the system. This means that if the evalu- 
ation of / and gj{X) [ hence for any specified design vector X is computationally 

very difficult, we can use coarse approximation during the early stages of optimization, and 
finer or more detailed analysis and approximation during the final stages of optimization. 
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In the exterior penalty function method, the ^-function is generally taken as 

= f{X) + nj2i9AX))‘' ( 2 . 3 ) 

j=l 

where, is the positive penalty parameter, the exponent q is nonnegative constant. The 
bracket function (^j(A')) is defined as 


(5jW) = max{gj{X),0) = { 


0 , 


ifgjiX) > 0 
(constraint is satisfied) 
if9j{X) < 0 
(constraint is violated) 


It can be seen from Equation(2.3),that the effect of the second term on the right hand 
side is to increase o{X,rk) in proportion to the q-th power of the amount by which the 
constraint is violated [12]. Usually,the function <j){X,rk) possesses a minimum as a function 
of A’ in the infeasible region. The unconstrained minima A'^J converge to the optimal solution 
of the original problem, as k oo and oo Thus the unconstrained minima approach 
the feasible domain gradually, and as k oo , (A'^) eventually lies in the feasible region. 

For q > I, the o-function will have continuous first derivatives, which is given by 


def) 

Oxi 


UXi 


dgAX) 

dxi 


Generally, the value of q is chosen as 2 in practical computation. 


The ALGORITHM for the Exterior Penalty Function Method is described in the follow- 
ing steps. 

(i) Start from any design vector A^'i and suitable value of ri, set K=l, 

(ii) Find the vector that minimizes the function 



2.1 Traditional Methods 


12 


(iii) Test whether the point A\! satisfies all the constraints. If A\! is feasible, it is the desired 
optimum. Hence Terminate, otherwise go to step(iv). 

(iv) Choose the ne.\t value for the penalty parameter which satisfies the relation r^+i > 
i.e. = c > 1. Set, K=K+1 , go to step(ii). 



Figure 2.2; The flow chart for the Exterior Penalty Function Method 

A flow-chart for this method is shown in Figure 2.2. To solve an optimization problem, 
involving both equality and inequality constraints the following form of ^function has been 












2.1 Traditional Methods 


13 


formulated. 

Ok = Tk) = f{X) + Tfc (5 j(A0)^ + E (2-4) 

j=: 1=1 

The unconstrained (;6-function can be minimized using Steepest Descent Method, as 
described below. 

2.1.2 Steepest Descent Method 

The search direction for minimization is the negative of the gradient vector of the 
function at any point X 

Si = -V/,(A) (2.5) 

Starting from an initial trial point A'l the algorithm searches towards the optimum point 
according to the rule 

AVi = A', + = A. - a: V/.(A) (2.6) 

where A* is the optimal step length along the search direction Si or — V/,(A'’). 

The ALGORITHM for the Steepest Descent Method is described in the following steps. 

(i) Choose a maximum number of iterations M to be performed, an initial point Aq, two 
termination criteria €i and £ 2 - Set i = 0. 

(ii) Calculate V/(A'j), the first derivative of the function at point (A"!) 

(iii) If ||V/(A'',)|| < Terminate. Else if i > M , Terminate. Else go to step (iv). 

(iv) Perform Unidirectional Search to find X* using £2 such that, 

/(Ai+i) = f{Xi - XlVfi) 


is minimum. Another Criteria for termination is when , 

|V/(Ai+x).V/(Ai)| < £2 
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(v) Check, whether 


iwll - 

If yes, Terminate. Else set z = (i + 1), and go to step (ii) 

A flow chart for the method is shown in Figure 2.3. The Steepest Descent uses the bounding 



Figure 2.3: The flow chart for the Steepest Descent Method 


phase and the golden section methods in order to perform the unidirectional search to find 
A* which minimizes f{Xi + XiSi), which are briefly described below. 
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2.1.3 Bounding Phase Method 

The ALGORITHM for the Bounding Phase Method involves the following steps. 

(i) Choose an initial guess Xq and an increment A. Set k = 0 

(ii) If f{xQ - |A|) > f(xo) > f{xQ + |Al),then A is positive; Else if /(xq - |A|) < /(xq) < 
/(xo + |A|),then A is negative; Else go to step (i). 

(iii) Set Xfc+i = xjt + 2^'A . 

(iv) If /(xfc+i) < /(xfc), set k = {k + 1) and go to step(iii). Else the minimum lies in the 
interval (xjt_i, xjt+i) and Terminate 

2.1.4 Golden Section Method 

The ALGORITHM for the Golden Section Method is described in the following steps. 

(i) Choose a lower bound a and an upper bound b . Also choose a small number e . 

Normalize the variable x by using the equation ir = . Thus Ou, = 0,6^, = 1, and 

lu, = 1. Set k = 1. 

(ii) Set Wi = Ou.. 4-(0.6lS)ilu.- and W 2 = b^ — {0.Q1S)L^, Compute /(r'i) or /(u.' 2 ), depending 
upon w'hichever of the two was not evaluated earlier. Use the fundamental Fundamental 
Region Elimination Rule [15] to eliminate a region: Set new a^ and b^. 

(iii) Is |Lu,| < £(a small number for termination )? If no . set A; = {k + 1) , go to step (ii), 
Else Terminate 

2.2 Fundamentals of the Genetic Algorithms 

Genetic Algorithms (GA) differ from the traditional search and optimization procedures 
in a numerous way. Basically, most of the traditional procedures are gradient-based, i.e. 
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starting from an initial guess vector, the search algorithm proceeds iteratively toward the 
optimal point taking the gradient information into account in the A?'-dimensional search 
space. In the previous section, one of the traditional method and its procedure has been 
described in details, showing how it leads to nearby optimum. In genetic algorithms, the 
approach is entirely different, as it work with the coding of the design variables, called 
individuals, a matter of which is very uncommon in traditional methods. This allows genetic 
algorithms to be applied to discrete or discontinuous function optimization problems. Since 
no gradient information is used in GAs, they can also be applied to non-differentiable function 
optimizations. This makes genetic algorithms much more robust in the sense that they can 
be applied to a wide variety of problems. 

Unlike, many traditional optimization methods, genetic algorithms work with a popula- 
tion of points, which more or less ensures the possibilities of obtaining the global optimal so- 
lution for any given problem. Genetic Algorithms use the probabilistic transition rule instead 
of fixed rules. The group of individtials, called population in genetic algorithms terminol- 
ogy, are guided to the global optimum, by powerful genetic operators, such as Reproduction, 
Crossover and Mutation. The randomness in the genetic algorithms operators has an effect 
of producing impartial search direction early on, thereby avoiding a hasty wrong decision 

values; GAs have a directed search later in the simulation. 

Genetic algorithms are search procedures which are based on the mechanism of natu- 
ral genetics and natural selection. The basic principle of genetic algorithm is based upon 
the quasi-Darwinian principle of survival of the fittest, combined with simulated genetic op- 
erators abstracted from nature, where an individual attempts to pass its clone to the next 
generation and it is evaluated in terms of its fitness which is either the function itself or some 
appropriate transformation of it [17]. For example, in a maximization problem of a function 
(f){X) in between the upper and lower bounds of the decision variable 
the fitness function f(X) is defined same as (l>(X.), but on the contrary, for a minimization 
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problem the fitness function may be defined as, 




1 

1 + 4^{x) 


(2.7) 


Several other suitable transformation methods are also available depending upon the nature 
and complexity of the problem. These transformation methods do not alter the location of 
the global optimum, but convert the minimization problem into a maximization problem. 

For any search and learning methods the way in which candidate solutions are encoded 
is a central factor in the success of a genetic algorithms [17]. In most genetic algorithms 
applications, fixed-length, fixed-order bit strings are used to encode candidate solutions [17]. 
The decision Vciriables are usually mapped into the desired interval in the corresponding 
solution space by a string (chromosoine) of binary alphabets T’s and ‘O’sfpene/ The length 
of the string depends upon the precision of the desired solution. In a multi-parameter 
optimization problem, individual parameter codings (substrings) are concatenated together 
to form a bigger complete string. Eventhough binary codings are mostly applied, but it is 
not absolutely nece.ssary. There exist some studies, whore the decision variables are encoded 
in real numbers. 

For example, if the five bit string are used to code the variable x,, then the strings 
(00000) and (11111) would represent x[^^ and since those substrings have the minimum 
and maximum decoded value. For converting the binary substrings to real space, linear 
mapping rule is common among the several transformation schemes [15]. 


Xi 



-b 


.iu) _ JL) 

i '^i 

2 '* - 1 




( 2 . 8 ) 


where the variable Xi is coded in a substring Si of length The decoded value of a binary 
substring Si is calculated as 

5i = 22'(si) (2.9) 

1=0 

where Sj G (0, 1). For five bits to code each variable, there are only 2^ or 32 distinct 
substrings possible, because each bit position can take either 0 or 1. 
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First, a population of strings representing the decision or design parameters is created at 
random. The size of the population depends upon the string length and the problem being 
optimized. Each string is then evaluated. The evaluation procedure first requires decoding 
of the decision variables from the chromosomes. Once the values of the decision variables are 
obtained, they are used to calculate the fitness function value. After that the Reproduction 
Operator takes over and mating pool is created. Further details are provided below. 


2.2.1 Reproduction Operator 


Reproduction is the first operator applied on a population. Reproduction selects good 
strings in a population and forms a mating pool [15]. The commonly used reproduction 
operator is the Proportionate Reproduction, where a string is selected for the mating pool 
with a probability proportional to its fitness value. Thus, the z-th string in a population 
is selected with a probability proportional to to (j){X). Since the population size is usually 
kept constant in a Simple Genetic Algorithm (SGA), thus the sum of the probability of each 
string being selected for the mating pool must be one. Thus, the probability for selecting 
the i-th string is [15] 


P.= 


4 >^ 


( 2 . 10 ) 


where, n is the population size. 


The way to implement this selection operator, is known as “Roulette Wheel Selection” 
in GA literature [19]. 

In “Roulette Wheel Selection” scheme, a roulette wheel is imagined with a portion of its 
circumference marked for each string which is proportional to the string’s fitness [15]. The 
roulette wheel is spun n times, and each time selecting an instance of the string chosen by 
the roulette wheel pointer. Since, the circumference of the wheel is marked according to the 
string’s fitness, thus it can be expected to make ^ copies of the z-th string in the mating 
pool, when the average fitness of the population is calculated as [15] 


fiavg — 


r.U't'i 


n 


( 2 . 11 ) 
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In Stochastic Remainder Selection” the expected count for each individual is calcu- 
lated. Then the strings are first assigned the number of copies exactly equal to the mantissa 
of the expected count. Thereafter, the regular “Roulette Wheel selection” is implemented 
using the decimal part of the expected count as the probability of selection. Furthermore 
“Stochastic Remainder Selection” selection method is less noisy as compared to “Roulette 
Wheel Selection” [15]. 

In Simple Genetic Algorithm (SGA), the population size is usually kept constant. Thus, 
the probability for selecting an individual from z-th class ^ as in the t-th. generation is given 
by 


Pi,t 






( 2 . 12 ) 


where ni denotes the number of individuals, in which k classes exist. According to Goldberg 
and Deb [20], for a non-overlapping population of constant size n, the expected number of 
copies of the f-th class in the next generation is 


nii^t+i - mi^t-n.pi^t 


h 

It 


(2.13) 


where ft, the average function value of the current generation is given as 



n 


This above eciuation may be written in proportional form dividing by the population size n 
as 

a,.+i = a,. I (2.14) 

By solving the Eejuation 2.12 explicitly, it can be shown that 


P^,t = 


/,‘a,o 




(2.15) 


In order to calculate the performance measure and time complexity of the reproduction 
operators, Goldberg and Deb [20] have defined take-over time as the time during which 

^Here, the class is essentially defined as individual’s relationship with a particular form of schema [19] in 

the population. 
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the population contains (n — 1) best individuals out of n population size starting from the 
beginning. Considering, the monomial test case, the takeover time can be calculated as 

t* = i(nlnn-l) (2.16) 


Thus the takeover time for a polynomially distributed objective function is 0(n logn). For 
comparison, the takeover time for an exponentially distributed (or exponentially scaled) 
function is given by [20] 


1 


nlnn 


(2.17) 


Thus, under the unit interval consideration, both a polynomially distributed function and 
the exponentially distributed function have the same rate of convergence. 

The major problem with fitness proportionate selection is that early in the search the 


fitnc.ss variance in the population is high and a small number of individuals are more fit than 


the others. Under fitness proportionate selection they and their descendent will multiply 
(puckly in the i)opulation, in effect preventing genetic algorithms from doing any further 
exploration, causing premature convergence [17]. In this situation Fitness proportionate se- 
lection results in a lack of diversity in the population. Avoidance of such problem involves 


the usage of low selection pressure (the degree to which highly fit individuals have many 


offsprings [17]) at the initial period when the population variance is high and increase the 


pressure at a later time period when the variance decreases. 

Rank Selection is the another selection operator, where sorting of the population is 
done from best to worst, the number of copies that each individual should receive is assigned 
according to a non-increasing assignment function and then a proportionate selection is 
performed according to that assignment. Ranking avoids giving the far largest share of 
offspring to a small group of highly fit individuals and reduces the selection pressure when 
the fitness-variance is high and keeps up the selection pressure when the fitness variance 
is low [17]. According to Baker [17], in the linear ranking method each individual in the 
population is ranked in increasing order of fitness from 1 to N. The expected value of each 
individual i in the population at time t is given by 


ExpVafii, t) = Min {Max — Min) 


rank{i, t) — 1 


iV-1 


(2.18) 
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where M in is the expected value of the individual with rank 1 and Max (this is assigned by 
the users) is the expected value of the individual with rank N. Goldberg and Deb [20] had 
shown that the takeover time for such a selection methodologj’ is 

2 

t* = -log{n-\) (2.19) 


This kind of selection technique had been identified to be potentially time consuming, 
especially when the population size is large. 

In order to make the selection methodology' in genetic algorithms less susceptible to 
premature convergence, Sigma Scaling technique has been adopted [17], which keeps the 
selection pressure relatively constant over the course of run rather than depending on the 
fitnes.s variances in the population. Under Sigma Scaling an individual’s expected value is 
a function of its fitness, the population mean, and the population standard deviation. 


Exp\''al{i, t) 


1-0 + ^^ ifcr{t)/0 
1.0 if cr(t) = 0 


( 2 . 20 ) 


whcue ExpVal(i,t) is the expected value of the individual i at time t, f{i) is the fitness of 
i, f{t) is th(’ mean fitness of the population at time t, and a{t) is the standard deviation of 
the population fitne.sses at time t. 

Tournament Selection Operator is similar to rank selection in terms of selection pressure, 
but it is computationally more efficient and more amenable to parallel implementation. In 
this selection mechanism a number of individuals (generally two) are selected at random from 
the population which are subjected to compete with each other. An uniform random number 
r is generated between 0 and 1, if r < ^: ( where k is a, parameter of preassigned value between 
0 and 1 ), then the better of the two individuals is selected for the mating pool. Finally, both 
are returned to the original population and can be selected again. According to Goldberg 
and Deb [15, 19], the idea of tournament selection basically comprises of selecting some 
number of individuals randomly from a population (with or without replacement) , selecting 
the best one from this group for further genetic processing, and repeating the sequences as 
often as desired (usually until the mating pool is filled). By solving for the takeover time 
yields an asymptotic expression which improves with increasing population size n ( Goldberg 
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and Deb) [20]. 


t* — r^[lnn + In(lnn)] 
Ins 


( 2 . 21 ) 


The idea of Elitism was first introduced by DeJong [24], which forces genetic algorithms 
to retain some number of the best individuals at each generation. Such individuals can be 
lost if they are not selected to reproduce or if they are destroyed by crossover or mutation. 

Stochastic Remainder “Roulette Wheel” selection, Stochastic Universal Sampling , Bi- 
nary Tournament selection are the more popularly used selection procedures considering 
time complexity as well as selection performance. In our study we have incorporated the 
binary tournament selection along with elitism, and also stochastic based selection opera- 
tors. Different selection schemes assign different number of copies to better strings in the 
population, but in all selection schemes the essential idea is that more copies are allocated 
to the string with higher fitness values. After reproduction the population is enriched with 
clones of good strings which are subjected to crossover and mutation. 


2.2.2 Crossover 

Genetic Algorithms employ some form of crossover- an operator that combines por- 
tions of the parental chromosomes to produce offsprings [17]. In crossover operator, new 
and expectedly better strings are created by exchanging information among the parental 
chromosomes selected by the reproduction operator for the mating pool. In a Single Point 
Crossover operator, generally two strings are picked from the mating pool at random and 
a crossing site (]) along the string is chosen randomly. Now a uniform random number is 
generated between 0 and 1, and if the random number r < Pc (probability of crossover) , then 
the operator exchanges the subsequences of the two string on the right side of the crossing 
site [17]. For example, for two five bit strings 


Before crossover : 



2.2 Fundamentals of the Genetic Algorithms 


23 


0 0 I 0 0 0 

1 1 I 1 1 1 

after crossover : 

0 0 111 

110 0 0 

where the crossing site is chosen randomly after the second locus of the strings. Basically, 
crosso\er mimics biological recombination between two single chTOxnosome{haploid} organ- 
isms [19]. 

In Older not to destioy previously found good strings of the mating pool, crossover is 
usually performed with a probability p<; < 1 (a pre-assigned value), because depending upon 
the crossing site chosen randomly, the crossover may have some detrimental effect along 
with its beneficial effect. The idea of crossover is to recombine building blocks (schemas) 
of different parental chromosome in the offsprings. The single point crossover operator has 
some bias of exchange for the right-most bits. They have a higher probability of getting 
exchanged than the left-most bits in the string. Single point crossover keeps intact short, 
low-ord('r schemas as the functional building blocks of the strings; so schemas with long 
defining lengths are likely to be destroyed under this operator [19]. This shortcomings can 
lead to the pre.scrvation of the hitchhikers It has also been observed in “ ROYAL ROAD” 
experiimmts by Mitchell et al. [17]. Single point crossover often treats some loci (crossing 
site) preferentially, as a result the segments exchanged between the two parents always 
contain the end-point of the strings. 

To reduce the positional bias, and end point effect, two point crossover is sometimes 
preferred, in which two positions are chosen at random and the segments between them 
are exchanged. The two point crossover is less likely to disrupt schemas with large defining 
lengths and can combine more schemas than single point crossover. In addition, the segments 
that are exchanged do not necessarily contain the end points of the strings. Again, there are 

^bits that are not part of a desired schema but which, by being close to the string, hitchhike along with 
the beneficial schemas it produces 
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schemas that two point crossover can not combine. In order to overcome those difficulties, 
multi point crossover is often used. Example of two point crossover is given below 

Before crossover : 

0 I 0 0 I 0 0 

1 I 1 i I 1 1 
after cross-over : 

0 110 0 

10 0 11 

Extension of the above crossover operators is the parameterized uniform crossover [17] 
where an exchange happens at any bit location from either parent with a probability Pc = 0.5. 
Parameterized uniform crossover has no positional bias , any schemas contained at different 
positions in the parents can be potentially recombined in the offsprings [19]. An example of 
the uniform crossover operator is given below, whore the first and the fourth bit positions 
have been exchanged. 

Before cro.ssovcr : 

0 0 0 0 0 

11111 
after cross-over : 

10 0 10 

0 110 1 

this operator has the maximum search power among all of the above crossover operators, but 
simultaneously has the minimum survival probability for good bit combinations (schemas) 
from parents to offsprings. In our present study single point crossover operator is used along 
with the option for uniform crossover. 

2.2.3 Mutation 

The crossover operator is generally regarded as the major instrument of variation and 
innovation in the population of the genetic algorithms, while the mutation operator prevents 
the population from permanent fixation at any particular locus and thus plays more of a 



2.2 Fundamentals of the Genetic Algorithms 


25 


background role [17]. Mutation operator is used sparingly and changes 1 to 0 and vice- 
versa, with a small pre-assigned probability The bitwise mutation or jump mutation is 
performed bit by bit by simulating of flipping a coin with probability pm- The simulation 
of flipping a coin is done by generating a uniform random number between 0 and 1, and if 
the random number is smaller than p,ri, then the outcome of the flipping is true, otherwise 
the outcome is false. For any bit if the outcome is true, then the bit is altered, otherwise 
kept unchanged. The need of mutation is to create a point in the neighborhood of the 
current point, thereby achieving a local search around the current solution. It also introduces 
diversity in the population whenever the population tends to become homogeneous due to 
repitative usi' of the selection and crossover operators. The inclusion of mutation introduces 
some probability {Xpm) of turning 0 to 1, where N is the population size. 

Another variant of mutation, called creep mutation is also used. In creep mutation each 
variable is examined in the real space, and is either increased or decreased by a small value 
with a probability of Pevet-p 'Wid then reconverted into the binary space. 

After new strings are created, they are evaluated by decoding and calculating the ob- 
jective function values. This completes one cycle of genetic algorithms iteration, known as 
generation in genetic algorithms terminology', the same process continues until a termination 
criterion is satisfied. In next few sections some advanced operators used in genetic algorithms 
and some modern developments in this methodology are discussed. 

2.2.4 Micro GA 

A Simple Genetic Algorithms (SGA) works with a serially implemented binary coded 
population of size N, with a generation to generation evolution based on reproduction, 
crossover and mutation. The performance of SGA can be measured in terms of on-line and 
off-line performance measures (the on-line performance measure is the average performance 
measure of of all the tested structures over the course of the search; the off-line performance 
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is the average of all the generation-based best performance structures). For the function 
optimization problems, where the function to be optimized is well defined and do not change 
faster than the evolution process itself, SGA can be implemented very easily. SGA therefore 
can handle stationanj function optimization problems quite satisfactorily to find the exact 
optimum. However, for the non- stationary function optimization problems, where the func- 
tions to be optimized are themselves evolving at a much faster rate, SGA can not find an 
optimum. Problems of this type are found in many real world situations, such as in aerospace 
(pursuit and evasion), on-line air-craft trajectory optimization, and the optimal control of a 
aircraft in wind shear etc. Goldberg has proposed that for serial implementation of binary 
G.AlS the optimal population choice is small, which was obtained from optimizing effective 
n’al-time serhema processing in a given population. He also pointed out that simply by taking 
a small population size and letting them converge is not very useful and outlined a scheme by 
which a small population GA can be implemented. Based on this approach, a step by step 
{)rocedure for the [l—GA implementation, as {jroposed by Krishna Kumar, is given below [16]. 

1 . Seh'ct a population of size 5 either randomly or 4 randomly and one good string from 
any previous search. 

2. Evaluate fitness and determine the best string. Label it as string 5 and carry it to the next 
generation (elitist selection strategy). In this way there is guarantee that the information 
about good schema is not lost. 

3. Choose the remaining four strings for reproduction (the best string also competes for a 
copy in the reproduction) based on a deterministic tournament selection strategy [19]. Since 
the population is small, the law of averages does not hold good and the selection procedure 
is purely deterministic. In the tournament selection, the strings are grouped randomly and 
adjacent pairs are made to compete for the final four (Care should be taken to avoid two 
copies of the same string mating for the next generation). 
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4. Apply crosso\er with Pc = 1. This is done to facilitate a high order of schema processing. 
The mutation rate is kept to zero as it is clear that enough diversity is introduced after every 
convergence through new population of strings. 

5. Check for nominal convergence (reasonable measure based on either genotype convergence 
or phenotype convergence). If converged go to step 1, else go to step 6. 

6. Go to step 2. 

2.2.5 Niching and Sharing 

In artificial genetic search, multi-modal functions are optimized by introducing the nat- 
ural concepts of niche and species into a population of strings. To achieve better performance 
on non-stationary functions, dominance and diploidy hseve been added [19]. To overcome this 
limitations in fixed codings, inversion md reordering operator have been suggested by Gold- 
berg. In dealing with multi-modal functions, simple genetic algorithms converge to a single 
peak [19], even when multiple peaks of equality may exist. Faced with a similar problem, na- 
ture forms stable subpopulations of organisms surrounding separate niches by forcing similar 
individuals to share available resources [17]. In nature, niche is defined as an organism’s task 
in the environment and a species is a collection of organisms with similar features. The sub- 
division of environment on the basis of an organism’s role reduces inter-species competition 
for environmental resources, and this reduction in competition helps stable subpopulations 
to form around different niches in the environment [21]. 

In the optimization of multi-modal functions as shown in Figure 2.4, a simple genetic 
algorithms cannot maintain controlled competition among the competing schemata corre- 
sponding to different peaks, and the stochastic error associated with the genetic operators, 
called genetic drift , for finite population, causes the population to converge to one alternative 
or other [21]. 
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In DeJong s Crowding scheme, separate niches are created by replacing existing strings 
according to their similarity with others in an overlapping population. Generation Gap (G) 
dictates the use of an overlapping model in which only a proportion G of the population is 
permitted to reproduce in each generation [21], 



(b) unequal peaks 


Figun' 2.4: Sample functions where stable, relatively no competitive subpopulation might 
be u.seful (Source; [19. 21]); (a) function with equal peaks for single variable (expected, sub- 
Iiopulation to be equal in size);(b) function with unequal peaks for single variable (expected, 
subpopulatioii sizes to decrease with decreasing peak size); 


When selecting an individual to die, crowding factor (CF) determines the number of 
individuals that are to be picked at random from the population, and the one which is very 
much similar to the new individual is chosen for replacement. Here similarity criteria is 
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defined in terms of the number of matching alleles ^ The new individual then replaces this 
chosen individual in the population. 

Depending upon the Holland’s sharing concept, Goldberg and Richardson incorporated 
the sharing function by dividing the population in different subpopulations according to 
the similarity of the individuals in two possible solution space : the decoded parameter 
space {Phenotypic Sharing) and the binary or gene space {Genotypic Sharing). A sharing 
parameter asharc is defined to control the extent of sharing, and a power-law sharing function, 
Sh(d) is defined as a function of distance-metric (d) between two individuals 

Sh(d} ={'■“' ^ (2.22) 

[ 0.0 if d > Cshare 

Sharing requires lowering of an individual’s payoff (fitness) due to presence of its neighbors 
in the search space. The parameter a„hare can be taken as the maximum distance between 
the strings necessary to form as many niches as there arc peaks in the multi-modal solution 
space. 


2.2.6 Phenotypic Sharing 


When the proximity of the of the individuals is defined in the decoded parameter i.e. 
real space, it is called phenotypic sharing. The distance metric {dij) is defined as the distance 
between strings in the decoded parameter space. For ap-variable function, the distance norm 
in the p-dimcnsional space is calculated by considering the Euclidean distance. 




(2.23) 


where :r, = [aq.,, T.yi, T, = [aq,,-, X2j, x J and Xp.i are the decoded 

parameters. If each niche is enclosed in a p-dimensional hypersphere of radius Ushare such 
that each sphere encloses Jth volume of the space, where q is the number of peaks in the 
solution space. The radius of the hypersphere is calculated as [19, 21] 


r- ^ 


TTa ^ {^k,max ^k,min 


y 


(2.24) 


^binary bits 
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and the volume is \' = cr^ where c is a constant. Thus Oshare can be obtained as 


p 1 p 


^share — 




So one will obtain from the above 

^ \/^k=l {^k,max ~ ^k,min) 

^share — ^ p 

2^ 

2.2.7 Genotypic Sharing 


(2.25) 


As described by Goldberg and Richardson [21], the genetic closeness of the two individ- 
uals are taken as the number of different alleles in their chromosomes. The distance metric 
djj is defined as the Hamming distance between the strings and ashare is the maximum num- 
ber of different bits allowed between the strings to form separate niches in the population. 
Comparing the two strings Si and Sj of string length 1, for allowable one bit difference the 
total number of strings genetically close are For k distinct allowable bits of difference, 
total strings arc possible in the solution space. As the total number of strings possible 
is 2‘ , then for allowable /c-bits (j[)/2^ strings are away from each other. Thus for all the 
strings in the population with k bit differences or less is given by (!)/^^- According to 
sharing, for (j-peaked function, there are q niches in the solution space. Assuming uniform 
niche placement ^ if k ( or cr^hure ) number of bits of differences allowed between the strings 
to make f/-subspaces in the solution space, then [21] 

k /1\ 1 

(2.26) 




2 ^ 1=0 ^ ' 


1 


2.3 FundamExNtals of Differential Evolution 

Differential Evolution (DB) is a simple evolution strategy that promises to make fast 
and robust numerical optimi'/ation. The overall structure of DE involves population based 


^each niche must correspond to an average of ^ of the total solution space 
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sCciiclies. The pArallol \cision of DE maintains two real valued D-dimensional arrays of pop- 
ulation number A. The pjimery enray holds the current vector population and the secondary 
one accumulates vectors that arc selected for the next generation[14]. In each generation N 
competitions are held to determine the population for the next generation. The i-th compe- 
tition picks the i-th population vector, called the target vector, and its adversary is known 
as the trial vector . The target vector that competes against the trial vector is also one of 
the trial vector’s parent. The trial vector’s other parent is a randomly chosen population 
vector to which a weighted random difference vector has been added. Mating is done be- 
tween tliis noi.sy random vector and the target vector, and is controlled by a non-uniform 
crossover operation that determines which trial vector parameters are inherited from which 
parent. Three factors those control the evolution under DE are population size(N), the 
weight applied to the random differential (F), and the constant that mediates the crossover 
operator (C R). The basic mechanism of DE has been explained in Figure 2.5. 

During initialization, the parameter limits are set and each parameter in every primary 
array vector is initialized with a uniformly distributed random value within the allowed 
range. To cietermine and preserve the cost (function value) of the initial population, each 
primary array vector is evaluated. Unless not restricted, DE can search beyond the specified 
limits. 

In order to perform mutation, DE uses an appropriately scaled perturbation of the 
population itself. Every pair of randomly chosen vectors {Xi,Xj )gives rise to a vector 
differential (-Y, - Xj ), and their weighted difference is used to mutate another vector Xk, 
such that 


= 1, -b FiXi - xff 


(2.27) 


The scaling factor F is a constant defined in the range 0 < F < 1.2, where the upper 
limit is determined empirically. The optional value of F lies in the range of 0.4 to 1.0 [14]. 
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Figure! 2.5: The biisic mechanism of Differential Evolution (Source: [14]). 

By mutating vectors, with population based differential, DE ensures that the solution space 
will be efficiently searched in each dimension. 

In every generation, each primary array vector Xi is targeted for recombination with 
a vector X'^. to produce a trial vector AV The trial vector (a noisy random vector) is the 
child of two parents and it must compete with the target vector. In order to determine the 
parental contribution for the trial vector parameters a series of D-1 binomial experiments [14] 
are done, whose outcome is either success or failure and its probability is determined by the 
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crossover constant (Ci?), where 0 < CR < 1. Starting at a randomly selected parameter, 
the source of each trial vector is determined by comparing the value of CR to a uniformly 
generated random number within the interval (0 , 1). If the random number is greater than 
CR, the trial vector gets its parameters from the target Xi , otherwise the parameters come 
from Xf. (noisy random vector). To ensure differs from A", by at least one parameter, the 
final trial vector parameter always comes from the noisy random vector, and therefore only 
D-1 binomial experiments are performed in a D variable problem. 

For selection, the cost of each trial vector is compared with its parent target vector. The 
vector with more optimal cost is allowed to the secondary array. After each primary array 
vector has been a target for mutation, recombination, and selection, the array pointers are 
swapped so that roIe.s of the two arrays are reversed. Thus, vectors in what was the secondary 
array becomes the target for transformation, while the former primary vector competes with 
the winners of the next generation. 

2.4 Fundamentals of Simulated Annealing 

The Simulated Annealing (SA) optimization algorithm is analogous to the physical pro- 
cess by which a material changes its state while minimizing its energy'. A slow cooling brings 
the material to a highly ordered, crystalline state of lowest energ}', while a rapid cooling 
yields defects inside the material. 

Considering A be a vector in i?" where {xi,X 2 Xn) are its components and f{X) 

be the function to be optimized where ai < xi < bi,.. a„ < Xn < bn be its n variables, 
each ranging in a finite, continuous interval. Although the function /(A) does not need to 
be continuous but it must be bounded. The SA algorithm for optimizing the function is 
shown schematically in Figure 2.6. It proceeds iteratively, starting from a given point Aq, 
and then generates a succession of points Aq, Ai, A^'a, ....Ai, ... leading towards the global 
optimum of the cost function. New candidate points are generated around the current point 
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Ai by Applying random mo\es along each coordinate direction. The new coordinate values 
are unifoiinl}’ distributed in the inter\als centered around the corresponding coordinate ATj. 
Half the size of these intei\als along each coordinate is recorded in the step vector If the 
point falls outside the definition domain of /, a new point is generated randomly until a point 
belonging to the domain is found. A candidate point AT is accepted or rejected according to 
the Metropolis Criteria, described below [22]. 

If A./ < 0, then accept the new point Xi+i = Al,- else accept the new point with • 
probability 

p(A/) = Exp(^) (2.28) 

where, A/ = /(A ) — /(A.) and T is temperature like parameter. At a fixed value of 
T the successive points A'o, A'l, A' 2 , ....A^, ... is not always downhill, except T = 0. When 
value of T is large compared with mean value of [/(AT/,) - /(AT.)|, where AT and AT are 
points randomly chosen inside the definition domain of /, then almost all the new points are 
accepted. SA starts with a high temperature value To and a sequence of points are generated 
until an 1 tpulihriuin is reached at that temperature. During this phase, the step vector Vm is 
periodically adjusted to better follow the function behavior. The best point reached so far is 
rccordf'd as ATp(- After thermal equilibrium is reached at Tg, the temperature is reduced and 
a new sc'cjuimce of moves are made starting from ATpt, until thermal equilibrium is reached 
again b Tlu' process is stopped at a temperature low enough that no more improvements 
are expected, according to a stopping criteria. 

S.-\ permits occasional uphill moves under the control of a temperature parameter. At a 
higher temperature only the gross behavior of the cost function is relevant to the search. As 
the lempiTaf un* decreases, finer details can be developed to get a good final point. While the 
optimality of the final point cannot be granted, the method is capable to proceed towards 

better minima even in the presence of many local minima. 

The number of function evaluations until next temperature reduction is the product of total coordinate 
directions, number of cycles in one iteration and number of iterations allowed 



2.5 The Method of Objective Weighing for More than One Objective Functions 35 


For step adjustment, according to Metropolis criteria [23], a higher number of accepted 
moves with respect to rejected ones means the function is explored with too small steps, 
and on the contrary, a higher number of rejected moves means that new trial points are 
generated too far from the current point. A [1:1] rate between accepted and rejected moves 
means that the algorithm is following the function behavior well. A step vector records 
the maximum incremental possible along each direction and is adjusted every iV^-th move 
to maintain the ratio. 

2.5 The Method of Objective Weighing for More 
THAN One Objective Functions 

For Multi-objective optimization problems the methodology of problem formulation is 
given below 

Minimize or Maximize fi{^) for i = 1,2,....,N 

Subject to 

ftl{Y) = 0.0 1 = 1,2..X J 

the parameter A" is a p dimensional vector, having p design or decision variables. 

In the Classical approach methodology the easiest way of handling such kind of problems 
is The Method of Objective Weighing, where the over-all objective function 

= El.WiUX) 

where x € A' and A'' represents the feasible search region. In the above formualtion, 

0 < lOi < 1.0, and = 1. 
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Figure 2.6: The flow chart for the Simulated Annealing Algorithm for Multi modal Function 
optimization (Source: [22]). 






Chapter 3 


Mold Design Problem 


3.1 Basic Aspects of Problem Formulation 

The Ii(}uicl steel begins to solidify as it enters the mold and forms the solidified strand 
shell. When the strand leaves the mold, the thickness varies between 10 to 30 mm, depending 
upon th(^ casting speed [1]. 

According to the product requirement of steel plant, a wide variety of cross sectional 
shapes and sizes can be continuously cast. The mold determines the cross section of a strand. 
Differiuit types of molds are required to cast different types of strand, such as rectangular, 
s(iuar(', round, hollow or polygon in cross section. Molds are generally classified into three 
categorii's [l] 

• Block molds. 

• Tubular molds. 

• Plat(' molds. 

Shaping wall of th(} molds are made of rolled or forged plates of copper alloys. These are 
bolted with a steel box which acts as cooling water jacket. The water jackets are supported 
by a steel frame on to which the gear boxes for the narrow face and the clamping mechanisms 


are mounted. 
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The steel water boxes perform a dual function [10] 

(i) They provide a stable support for the copper liners. 

(ii) Ensure even distribution of the mold cooling water over cooling surfaces. 

Mold lengths vary between 800 and 1000 mm depending on the production requirements [1]. 
The length is usually determined as a function of the casting speed. From heat transfer 
measurements, it was suggested that high casting speed requires a longer mold [8]. 

Mold oscillation is produced by means of cam- rollers. As a result, sinusoidal motion of 
the mold is produced, which is shown in Figure 3.1. 



Negative strip time = t„ = 1/f- 

Figure 3.1: The essential features of mold oscillation (Source: [1]). 

Movement of the mold prevents sticking of the solidified shell to the mold wall. Other- 
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wise, tensile stresses produced in the solidified shell leads to surface cracking of the strand. 
Parameters involved in this reciprocal motion are the frequency (F) and the stroke length 
(S). Surface quality of the product largely depends on the correct choice of these param- 
etcis. Duiing downwaid movement of the mold [negative strip'j, axial pressure is applied 
to the solidified shell, which prevents transverse cracking of the strand. Oscillation marks 
are produced from the interaction between the casting slag and the mold movement. The 
magnitude of the negative strip time was calculated by [10, 45, 46] as 



where'. Tv is the negative strip time in (sec), F is the frequency of the mold in (cycles/sec) 
and \ 'c is the casting speed in (mm/scc). Short strokes or small amplitudes at high frequency 
have been successful in minimizing the oscillation marks [ij. In order to obtain optimum 
negative strip time T,\ of 0.2 to 0.3 seconds under fluctuating casting speed, mold movement 
and casting .si>eed are synchronized [1, 45]. 

Mold fluxes are .synthetic slags used during the continuous casting of steel. Fluxes are 
continually fed on the licjuid pool surface during casting. As a result, it melts and liquid 
slag is formed. The liquid slag flows down between the mold wall and the solidified shell. 
Figure 3.2 scliematically shows the general disposition of a flux in the continuous casting 
mold [26, 27]. 

The flux above the surface of the molten steel consists of three different layers [44]. 

(i) An imimdted, dark, unreacted powder layer on top. 

(ii) A h(*terogeneous sintered layer in the middle. 

(iii) A molten flux layer over the liquid steel. 

A rim is formed by flux in contact with the water-cooled copper mold walls at the 
meniscus level, which has a considerable influence on heat transfer process at the meniscus 
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Figurf' 3.2: Tlie schematic representation of flux behavior in a continuous casting mold 
(Source: [26]) 


Below the meniscus, flux layer consists of a solid film directly in contact with mold walls 
and a liciuid film in contact with the strand surface [26, 28]. Mold fluxes are used for several 
purp<Lses [44]. 

(i) proviile thermal insulation. 

(ii) prott'ct the liquid steel against re-oxidation. 

(iii) absorb the nou-rnetallic inclusions. 

(iv) lubricate the strand and reducing strand friction. 

(v) control the heat transfer between strand and mold. 
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Ogibayat.lii et al. [34] investigated the melting behavior of prefused type, granulated 
powders for different viscosity range and vitrification ratio between 25 and 90%. The vit- 
rification ratio was defined as the percentage of molten flux formed when 25 gm of powder 
were unidircctionally heated at 1400°C for seven minutes in an alumina crucible [44]. After 
ncating, the tiucible was cooled and cut through the center and the height or % of liquid 
flux is taken as an index of melting rate. The percentage of liquid flux formed was termed 
as vitrification rate by the researchers at Nippon Steel [31, 35]. When the vitrification rate 
exceeds 80%, the flux tends to form a flux rim which causes partial solidification of liquid 
steel surfac'c in a slab caster [44]. When vitrification rate is too low, the longitudinal cracks 
are fornu’d. In order to prevent such defects, Ogibayashi et al. [34] established an optimum 
flux pool (h'pth of approximately 10 mm corresponding to an optimum vitrification rate of 
about C0/{.'. 

Once the flux has been added to the mold, tw'o major phenomena occur [44] 

(i) mold powder is heated in contact with the liquid metal, melts and feeds the surface 
with a liquid phase. 

(ii) the li<}uid phase spreads over the steel surface and infiltrates into the gap betw'een 
strand and mold during oscillation cycle. 

Though the infiltrated flux solidifies in contact with water-cooled copper mold wall and 
forms a solid layer, a molten flux layer remains which ensures lubrication of the strand-mold 
interface; [35]. The molten flux pool is formed above the meniscus of steel which acts as a 
reservoir and id(>ally provides a continuous supply of liquid flux to the mold-strand gap. It 
is important that this flux-pool has sufficient depth to feed the gap adequately all the times 
[36]. Mold powder consumption rate is related to the thickness of the liquid film, as well as to 
the depth of oscillation marks. The powder consumption rate decreases with the increasing 
casting speed and a reduction in oscillation frequency and negative strip time causes the 
increase in powder consumption rate [32, 36]. 
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Nakano ct.al [3o] also showed that the agitation on the liquid steel surface accelerated 
the melting of mold fluxes and increase the thickness of flux pool. A flux pool thickness in 
the 10 to 15 mm range has been recommended as an optimum value [13, 29, 33, 34, 38]. For 
higher casting speed Mills recommended a minimum thickness of 20 mm. Bommaraju [30] 
recommended a minimum thickness of C to 12 mm, which is in the same range reported by 
Isai and Mastenich [37]. Kakano et al. [35] suggested a minimum flux pool depth (Ip) is 
given by 


yp = Ssin 



— —2 + 5 


(3.2) 


\sheie S is stroke length of mold oscillation in (mm), F is the frequency of oscillation in 
(cych's/min), Ic is the casting speed in (m/min), 6 is the molten surface oscillation in (mm) 
and A' is the negative strip ratio in (sec), which is given by 


A' = 1 - 



(3.3) 


Koyama et al. [31] established the following empirical equation relating the average flux 
pool thickness with vitrification rate, mold dimensions, casting speed, and powder consump- 
tion rate. 


0.02 f 


\abwV 


(3.4) 


where, d is the flux pool depth in (mm), Sji is the vitrification ratio in (%), a and b are 
the transverse mold dimensions in (m), V is the casting speed in (m/min), and w is the 
con.sum[)tiua rate in (kg/t). 

Heat transfer in the mold is one of the most important phenomena taking place during 
the continuous ca.sting of steel. In order to get slabs with good surface quality, proper 
heat extraction is iH’cessary. If the rate of heat removal is excc.ssivc and uneven, thermally 
induced str(.‘.s.scs are generated in the solidified shell. As a result, longitudinal cracks may 
occur. Insufficient heat removal can lead to a thin solidified shell prone to bulging or break- 
outs [39, 40, 41]. 

From the iicjuid solid-steel interface heat is transfered to the mold by a sequence of steps 
consisting of [44]. 


(i) Onu'cctiou in the liquid steel pool. 
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(ii) Conduction through the solidified shell. 

(iii) Heat transfer across the flux layer infiltrating the strand-mold gap. 

(iv) Heat transfer across any air gap separating the strand and the mold. 

(v) Conduction through the mold walls. 

(vi) Convection at the mold-cooling water interface. 

The largest resistance to heat extraction is encountered between the solid shell and 
mold wall [39, 40, 43]. Therefore the mold heat transfer is controlled by those factors which 
determine tlu' thickness of the gap separating the solidified shell and the copper mold wall 
and the properties of the flux infiltrating the gap. 

Tlu' basic aim of the continuous casting mold design problem cosidered in this study is 
to maximixi' the spinnl of casting while keeping the various operational variables within some 
recpiirt'd rangi'. This basic goal can be achieved by formulating the continuous casting mold 
design problem as an ojidmization problem. In the following three different formulations of 
the mold dt'sign are jjre.sented. 

3.1.1 First For.mulation 

Considering the empirical equations given by Nakano et.al [35] and Koyama et.al [31], 
the first objective function is formulated here as follows. 

Maximize. 

0.00157s^/sOT(7rA^) 

where, \ ' is the casting velocity. We formulated the constraints from a physical consideration 
of the mold behavior. The constraints being considered are as follows: 


«(!) = ( 7 ) - 0.99 > 0.0 

(3.6) 

g(2) = I.OOl - (^) > 0.0 

(3.7) 


where, d is the flux pool depth in (mm), Yp is the minimum liquid pool depth in (mm), s is 
the stroke length in (mm), / is the frequency of oscillation in (cpm) , S is the molten steel 
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surface oscillation in (mm), N is the negative strip time in (sec), Sr is the vitrification ratio 
in (%) and tv is the consumption ratio in (Kg/ton). The parameter (ab) is the size of the 
billet being castcd (in sq.m.) 

Hovexoi, in this formualtion the basic mechanisms of heat transfer can not be consid- 
ered. This is not good enough to study the solidification process takes place in the primary 
cooling region of a continuous caster. Hence, in order to incorporate the heat extraction 
mechanisms into the problem description the second formulation is done. The formulation 
is presmitcd in the next section. 

3.1.2 Si':coN'D Formulation 

The ('onfigiiration of the mold region is shown schematically in Figure 3.3. Heat balance 
has h(’en done for this configuration following the approach of Geiger and Poirier [49], in order 
to get the governing eciuation for the casting speed in terms of various operational parameters 
as shown below. For a considerable ranges of solidified shell formation the casting speed is 
optimi/.t'd in this study. Furthermore, for a very thin solidified shell thickness the strand 
may bulge out and a very high solidified shell will reduce the casting speed making the caster 
an inefheient one. 

While in the mold, the conduction of heat in the thin shell of solid is much greater in 
the x-<lirection than in the y-direction. Therefore the conduction of heat in the withdrawal 
direi'tion may he iguon'd. .»\.s the metal passes through the mold, it can be assumed that a 
const am heat Iransfc'r cm.-fficient applies to account for the mold-casting interface resistance, 
as lu'at is removed by the water-cooled mold maintained at To. 

The interfacial resistance predominates over the resistance offered by the solidifying 
metal (4£)].'rhf^ temi)eiature gradients within the mold and the casting are negligible. Thus 
the total amount of he;it Q that transfers the mold-casting interface in time t is 

Q = hA{Z-To)t (3.8) 

If the temperature gradients are negligible, then Ts — Tm, and only latent heat is removed 
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Figun* .‘5.3: ( 'onli.i’.uralion of the mold used in the second objective function formulation 


from the casting during solidification. As Q = p'vHj , then we can obtain 

V HTm-Tq) 

A " p'lff 


(3.9) 


where // is tlu* density of the solidifying steel, is the volume of steel solidified, A is the 
c,ross-S(‘ctional art'a and /// is the latent heat of fusion and h is the heat-transfer coefiScient. 
As t.lu‘ shtipe has no efhKd., Eciuation (3.9) can be applied for the uni-directional solidification 


in tin; form 


M = 


h(TM-Z)^ 


(3.10) 


wher<; M is the iJdekness solidifierl. Now, considering the case Ts^Tm, and heat leaves 
the casting by t’onvection at the surface to a water-cooled mold maintained at T^, we can 
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calculate the heat flux at the casting-mold interface as 


^li=0 




(3.11) 


9l„o = '>W-r„) (3.12) 

Xow , ci)iiihining the' Eciiiations (3.11) and (3.12) and eliminating the surface temperature T 5 , 
and assuming the liiuMi temperature profile, we obtain the flux at the solid-liquid interface 

as 

I , — Tq 

9la:=0 ^lx=A/ l , A/ (3.13) 

A + F 

Also, at X = J/, the latent heat is evolved such that, 

. / dhl 

4=m = P^/-^ (3.14) 


By ctuuhining tlu* two equations and integrating with M = 
w(' obtain [49], 


M = 


hXu - %) 
pHj 



0 at t = 0 and M =■ M at t = t 

(3.15) 


.Acctirding to .‘Idanis [49], a more refined analysis yields 

- pH, a ‘ " 5 ?'^^ 

where the parameter a can bt* defined as 


(3.16) 


_1 1 . CpiT„~Z) 

“ ” 2 4 3H', 


(3.17) 


Now, defining t as the di.stance down the mold wall L, divided by the casting velocity U, 
we obttiin the* .sc'cond obj(‘Ctive function 


2k'h'{TM-To)L 
p'H}a{2k'M + M^h) 


(3.18) 


where, M Is the solidified shell thickness , L is the mold length , To and Tm are the 
ttunperatures shown in Figure 3.3 , h is the heat transfer co-efficient at the mold-solid 
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, k is the thermal conductivity of the solidifying metal and p is the density of the 
solidifying metal . 

Assuming that the liciuid temperature is uniform with mold length at Tp, and at the 
solid-liciuid interface, latent heat plus liquid superheat is conducted towards the mold wall 
through the solid skin, we can define the effective latent heat of fusion H) as, 

Iif = Hf + Cpi{Tp-T^{) (3.19) 

wlu'K’, /// is tlu' latent heat of fusion, Tp is the pouring temperature of the liquid metal, 
Cpi, is the specific heat. 

Th(' ()h.jective function u.sed here : 

CASTING VELOCITY = w,U + W 2 V (3.20) 

when*, 1. and \ varit's within a very .small range, so the another constrained used here is 

Si<\{U-V )\<62 (3.21) 

whi-re, ri'j and Ah an* varying within a very small range. 

'fhe simplified foimulation. a.s presented here provides a reasonable approximation of 
the mold <lesign pruhlem [3|. However, for a better description of the mold problem we have 
to ciuisider a number of additional factors [3]. The additional factors, considered in the 
formulation, are jire.^ented in the next section. 

3.1.3 TTiihi) For.\ii:l.4TIO.\ 

Following Brim.'icomhe anil .Sarnarasekera [51] a more reliable and precise mold config- 
uration is .si'heinatirally shown in Figure 3.4. 

'I'he nmshy that forms in a continuous caster retains a portion of the latent heat 
affecting tlu’ the tmipiuature profile in the mold region. Among other factors cooling water 
velocity, thernu'd conductivity of the mold-powder that fills the gap between the innerwall 
of th<‘ mold ;md the outer surface of the casting have a crucial role in the mechanism of 
heat transfer in contiiiuou.s casting [3], In order to make a precise approximation, we have 
formulatetl th<* casting speed in the following fashion. 

2k'ih + 6i){TM-To)L 

p' Hfa[2k' M + AP(h + (5i)] 


(3.22) 
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Figure 3.4; The configuration of the mold the third objective function formulation 


wlu're h is the modified heat transfer coefficient for the gap region, (5i is the heat transfer 
coefficient correction factor. The available latent heat of fusion was adjusted through a 
mushy zone-correction factor e, 

Hf = {I - e)[Hf + Cpl{Tp -Tm)] (3.23) 

Following the approach adopted by Geiger and Poirier [49], we can consider an imaginary 
ri’fi'rcnice between the mold and the casting, which is described in the Figure 3.4 as T^, where 
T). is constant. The contact resistance is then approximated on both sides of the mold-metal 
interface as 


h = 


hjw + he 


2 


(3.24) 
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where, 


and 


/ kpCp . 
ik'p'cy 

(3.25) 

k'pCp 
\ kpep 

(3.26) 


he — h[l + 

where, k is the thermal conductivity, p is the density, Cp is the specific heat respectively. 
The primed quantities are on the steel side while the rests are on the copper mold side. As 
the value of h is only approximate, thus a correction factor 5i was included. The parameter 
‘a’ is defined in the following way 


0=2 + 






4 31(1-£:)H;1 

Using these expressions the objective function here is formulated as 

CASTING VELOCITY =wiU + W2V 

where U is as given in Equation 3.22 and V is as given in Equation 3.5. 


(3.27) 


(3.28) 


3.1.4 The Constraints For The Third Formulation 

The first constraint formulated here is 


Si< {U- V) 


< h 


(3.29) 


where, and 62 are varying witliin a very small range. 

By equating the average mold heat-fiux expression (provided by Brimacombe [51]) with 
heat-flux at the mold-metal interface, the first physical constraint is formulated as [3, 2] 


2675.2 - 334.4 - (h -b <5i)(r5 - r„) 


(3.30) 


where the time was scaled as ^ and the original value of the constants is converted to SI 
unit. Assuming the mold-gap thickness [Ag) filled with mold powder of thermal conductivity 
km, we calculate the second constraint as [3] 

km + ^2 


^9 


(/i -b ^l) — 0 


(3.31) 
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Due to significant uncertainty about km a correction factor 62 is added to it. 

The third equality constraint was formulated by conducting heat-balance at the cooling 
water side [3] 

hwk{To — Twout) — {pvjCpw^iTwout — Twm) — P)ik + h^Atmoid) = 0 (3.32) 


Heat transfer between the mold wall and the cooling water takes place by forced convection 
which has been accounted for by the help of a heat transfer coefficient (h^) determined using 
the following correlation [47, 48]. 


K = 0 . 023 ^ 

where, the various temperature terms are shown in the Figure 3.4. (j) denotes the cooling 
water velocity (m/sec), Atmoid is the thickness of the mold, (5 denotes the heat loss adjustment 
factor, /z is the viscosity, D// is the hydraulic diameter, and parameters on the cooling 
waterside are denoted with a subscript w. 

In the final formulation the equality constraints are combined together to form 




Pvj 


(3.33) 


2675.2 - 334.4 



{km + <^ 2 ) 
Aghujk 


[hujk{Ts — Twout) — [pwC pw<i>{Twout — Twin) ~ P) 


{k + h.ujAtmoid)] — 0 (3.34) 

A total of eighteen important operational parameters including the correction factors have 
been optimi2ed by using different optimization algorithms, details of which are described in 
the subsequent chapter. The values of the constants along with references and parameter 
ranges arc given in Tables (A.1-A.4) Appendix A. 
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Computational Aspects 

In order to get the optimum values of the operational parameters for the multi-variable 
objective functions used in this study, several optimization approaches have been carried 
out. These approaches are briefly described below. 

4.1 Optimization using Steepest Descent Technique 

Penalty function method coupled with a steepest descent algorithm was applied first to 
get the global optimum values. The objective function given in Equation(3.22) has been tried 
out by this approach first. The equality constraint considered in the formulation given in 
Eciuatioii(3.34), has been transformed into two inequality constraints varying within a very 
short acc{!ptable range. Thus, the equality constraint h{X) = 0, gives rise two inequality 
constraints 

^i(A)=/i(X)-5i>0.0 (4.1) 

92 {X) = 62 - h[X) > 0.0 (4.2) 

where. Si and ^2 are varying within a small range and S 2 > ^i- Other inequality con- 
straints have been formulated considering the upper and lower bound of the design variables. 
In order to obtain same contribution for all the inequality constraints in the penalty term, 
and to make the search more efficient in the hyper-dimensional search space, the normaliza- 
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tion procedure has been adopted. So, the formulation of the penalty term in Equation(2.2), 
given by the following equation 

- 1-0 > 0.0 (4.3) 

9j^max ) 

In this study the exterior penalty parameter approach had been adopted to handle the 
inequality constraints, while the optimization was carried out by using the Steepest Descent 
search procedure. 

The multi-modality and differentiability of the objective function used here can be 
determined from Figures 4. 1-4.4. The three dimensional contour plots are drawn by varying 
two parameters within the specified ranges along x and y axes and plotting the corresponding 
objective function value, (i.e. the casting speed), along z axis. For all the plots, dimension of 
the Ciist billet was taken as 150 sq.mm. The values for the flux pool depth (Fj,), liquid pool 
depth (d), stroke length of the mold (s), molten steel surface oscillation (d), vitrification ratio 
{Sr), mold-flux consumption rate (ic) and temperature at the inner surface of the mold {To) 
were kept constant at 15mm, 15mm, 20mm, 10mm, 65%, 0.9 kg/ton, 450 /il', respectively, for 
all the figures. 

From the figures, it is evident that the objective function behaves unimodally with the 
variation of the solidified shell thickness (m), frequency of mold oscillation (/), and negative 
strip time {N). However, there is a highly non-linear interaction and multi-modality of 
the objective function in the hyper-dimensional search space with respect to the different 
correction factors such as 5 1 , £ etc. 

It is rather well known that the gradient based traditional methods are very much 
.sensitive to proper initial guess vector and also dependent upon the upgradation technique 
of the penalty parameters. It is obvious from the figures which are describing the gross 
fuiK'tion behavior, that the steepest descent method should be unable to converge properly at 
the global optimum satisfying all the constraints. Thus, during the actual calculations it was 
not possible for this search procedure to optimize all the eighteen parameters simultaneously 
for the multi-variable objective functions used in this study. Therefore, the objective function 
given in Equation (3.28) could not be used with the steepest descent algorithm, a simplified 
problem with a total of eight variable described in Equation (3.5) was used instead. 
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Figure 4.2: Variation of the objective function value i.e. casting speed (m/min) with respect 
to negative strip time N and solidified shell thickness M 


the population. The single point crossover with crossover probability pc (preassigned value 
between 0.7 and 0.9) has been found to do well rather than uniform crossover. Jump muta- 
tion and creep mutation operators have been employed with low mutation probabilities pm 
and Pererp, usuallv taken in the range of 0.001 to 0.01. To handle the constraints, the pe- 
nalized 0-function has been considered as the fitness function during the genetic operations. 
The penalty parameter R is adjusted properly based upon the value of penalty terms. The 
number of inequality constraints which has been considered is also less, because in GA the 
variable bounds are automatically kept fixed while employing the linear mapping scheme. 
The objective function has been penalized heavily while the constraints are violated. 

In this problem there seems to be pronounced tendency of the objective function, i.e. 
the casting speed, to go into the negative regime, which has no physical significance. In 
order to overcome this problem, we have assigned a very low fitness value { O(10~®)} of 
such individuals, while performing the GA calculations. We have also tried out to find the 
convergence using p-GA, with the options for elitist selection and uniform crossover. But 
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Figure 4.3: Variation of the objective function value i.e. casting speed (m/min) with respect 
to mold wall temperature To and correction factor e 

the performance of SGA appears to be much better than micro-GA, where the domain for 
this stationary objective function is precisely defined. 

To optimize all the eighteen parameters, genetic algorithms required a population size 
(iV) of 450 individuals and was able to converge after several thousands of generations making 
the whole process highly computation intensive. 

4.3 Optimization using Differential Evolution . 

In this study the simpler real valued approach of differential evolution (DE) has also 
been applied to find the global optimum. Differential evolution has a tendency to search 
beyond the search-space which has been restricted by fixing the upper and lower bounds 
for different parameters, by perturbing the population vector generations after generations. 
Among different strategies of crossover used in DE, the binary crossover strategy has been 
found to perform satisfactorily. A modular division has been employed for selecting the 
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Figure 4.4; Variation of the objective function value i.e. casting speed (m/min) with respect 
to correction factor s and correction factor 


parameter value to be taken directly from target to trial vector for every design vector in the 
population. After thousands of generations DE also has reached the near optimal solution 
for all the eighteen parameters in the multi-objective function formulation described by 
E(iuation(3.28). 

4.4 Optimization using Simulated Annealing 

Simulated Annealing^ one of the adaptive non-traditional search procedure, has also been 
employed here to find the global optimum for the multi-variable objective function given in 
E(luation(3.2S). The initial temperature {To) has been taken to a very high value such as 
500 A'. To make the search procedure more robust in every coordinate directions of the hyper- 
dimemsional search space, the number of iterations before temperature reduction (Nt) has 
been taken as 100 and number of cycles (iV,) performed in every iteration is taken as 20. 
As a result, before any temperature reduction the number of function evaluation required is 



4-5 Computation Environment 


57 


36000. Temperature is reduced further after thermal equilibrium has been reached at any 
particular temperature. The search is carried out until all the parameters finely converged 
to global optimum. 

4.5 Computation Environment 

In order to satisfy the requirements of the highly computation intensive nature of this 
problem a number of machines were used in this study. Some tasks were done by using the 
linux version of FORTRAN 77 (f2c)compiler under Linux environment in pentium machines 
and rest of the job is done using /77 -MIPS and MIPSpro F77 compiler in Silicon Graphics 
workstation with central processing units : MIPS RlOOOO, Rev 2.6 under IRIX64 Release6.4 
operating system. In both the cases, it took the cpu time 18000.00 sec on an average for a 
single run until converged, where all the eighteen decision variables have been considered to 
form the objective function. 



Chapter 5 


Results and Discussion 


5.1 Results Obtained by Steepest Descent Method 

The results obtained by using steepest descent method are given in Table A. 6. It is 
rather well known that the gradient based traditional methods are very much sensitive to 
proper initial guess vector and also dependent upon the upgradation technique of the penalty 
parameters. Hence, steepest descent method is unable to converge properly to the global opti- 
mum for a non-linear, multi-modal objective function like Equation (3.28) used in this study. 
In this case it was not possible to optimize all the operational parameters simultaneously 
using the steepest descent search procedure. 

5.2 Results Obtained by Genetic Algorithm 

In this study genetic algorithms was successfully applied to all the three formulations 
d(>scrib(Kl in Chapter 3. While operating with genetic algorithms, the task of optimization 
has become much easier compared to steepest descent method. 

Using GA, we first tried to find out the optimum values for design parameters related 
to the oscillatory mold. The results obtained using SGA for the first formulation (Equation 
(3.5)) are given in Table A.5. In order to find out the relational behavior of the decision 
parameters with casting speed, we have varied the variable ranges in a predefined way. 
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Figure 5.1; Optimum casting speed as a function of the variation of stroke length of the 
mold. 

From Figures 5.1 and 5.2 we can easily predict that the casting speed increases with 
increase in stroke length and decrease in vitrification ratio of mold flux in oscillatory mold. 
The optimum values for different parameters obtained using the objective function given in 
Equation 3.28 are given in Tables A. 8 through A. 10. 

5.3 Results Obtained by Differential Evolution and 
Simulated Annealing 

It took differential evolution atleast 1500 generations to reach the near optimal solu- 
tion wlien all the eighteen parameters in the multi-variable objective function formulation 
described in Equation (3.28) were considered. Often it was necessary to run upto 2500 gen- 
erations to obtain the proper convergence. Different optimum parameter values computed 
u.sing differential evolution are enlisted in Tables A.ll through A. 13. 
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Figure 5.2: Optimum casting speed as a function of the variation vitrification ratio of the 
mold. 

All the eighteen optimal decision parameter values obtained by simulated annealing 
itself, arc given in Tables A. 14 through A.16. 

5.4 Comparison of Performances of Different Op- 
timization Methods 

The total number of function evaluations required in simulated annealing is 72,00,000 
on an average for every single run until convergence. For genetic algorithms the correspond- 
ing number is much less (approximately 6,75,000) which is also comparable with function 
evaluations required in differential evolution. 

In order to compare the performance of all these optimization procedures adopted here, 
the following figures (Figures 5. 3-5.8) have been plotted, and they are also compared with 
the industrial data [51] simultaneously. The optimum casting speed has been calculated by 
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Figure 5.3: Optimum casting speed as a function of the variation of solidified shell thickness 
in different optimization procedures. 


thes(' methods for different dimension of the cast billets. 

From the figures, it is clear that the predictions made by genetic algorithms for the 
optimum value of casting speed as a function of different operational parameters are more 
satisfactory compared to differential evolution, and simulated annealing. The results obtained 
by simulated annealing for the optimal casting speed with the variation of solidified shell 
thickness are physically infeasible showing no variation in casting speed. The results obtained 
by traditional steepest descent method are not fully converged, and the values of the equality 
constraint as well as the optimum casting speed are so large that they are not considered 
whil(> drawing the plots. Also the optimum casting speed predicted by this method is so 
high that it appears practically infeasible. The variation of casting speed with negative 
strip time is also considered. Simulated annealing gives much higher values than the normal 
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Figure 5.4; Optimum casting speed as a function of the variation of solidified shell thickness 
in different optimization procedures. 


industrial practice, whereas genetic algorithms and differential evolution predicts the mean 
approximately. It can be concluded therefore that genetic algorithms itself work better than 
all the other optimization procedures. 

5.5 The Combined Optimization Approach based on 
GA AND SA 

In the plots, the results obtained by simulated annealing are quite interesting, because 
tlu’ value of the equality constraint (Equation (3.34)) obtained there almost tends to zero 
{0(10“®)}, and simultaneously all the inequality constraints are satisfied, whereas genetic 
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Figure 5.5: Optimum casting speed as a function of the variation of solidified shell thickness 
in different optimization procedures. 


algorithms itself is unable to satisfy constraints to that level. Therefore, in order to obtain 
the global optimum values of the casting speed with the variation of other design variables 
we have used genetic algorithms to reach the near optimal point and for finer convergence 
simulated annealing is used with a low initial temperature (To). The results obtained by the 
optimization scheme adopted here are given in Tables A.17 through A.19. 

Tim results which are obtained by coupling SGA and SA are almost comparable with 
the industrial practice. The velocity profile that has been obtained with the variation of 
solidified shell thickness properly matches with the normal operational trends [51]. The 
cooling water flow rate that has been taken in the calculation is approximated form the 
value given by Gieger and Poirer [49]. 

The dependence of casting speed on liquid pool depth (Ip) and flux pool depth (D) is 
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Figure 5.6: Optimum casting speed as a function of the variation of negative strip time in 
different optimization procedures. 

alsf) shown in the Figure 5.9, from which it can be concluded that the casting speed does not 
vary so much with these parameters in the range. However, there is a significant effect of 
stroke length (5) of the oscillatory mold on the casting speed which is shown in Figure 5.11. 
The casting speed increases proportionately with the increase in stroke length. The casting 
.spei'd varic's inversely with the vitrification ratio {Sr). The casting speed also increases with 
an increase in frequency of oscillation, but after a certain range the casting speed almost 
becomi? a constant. In the range of parameters considered in this study, the contribution of 
the term in the denominator of Equation (3.5) appears to be significant. The values 

of the optimized velocity therefore goes up with increasing billet dimension. All these trends 
are shown in Figures 5.9-5.12. 

The negative strip time {N) shows a typical relationship with casting speed. Initially 
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Figure 5.7: Optimum casting speed as a function of the variation of negative strip time in 
different optimization procedures. 

th(! ca.sting speed increases with the negative strip time, but with further increase in N the 
casting speed decreases. It can be concluded that the solidified shell thickness, vitrification 
ratio, negative strip time, stroke length are the major important parameters in controlling 
the casting speed, which almost remains constant for a reasonable variation of surface tem- 
perature Ts between 1450 to 1500 K and inner mold wall temperature To between 400 to 
450 A', because of the other parameter settings. 
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Figure 5.8: Optimum casting speed as a function of the variation of negative strip time in 
different optimization procedures. 
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Figure 5.9; Optimum casting speed as a function of the variation of flux pool depth and liquid 
pool depth, when the dimension of the cast billet considered as 100 sq.mm., 125 sq.mm., 
and 150 sq.mm. 
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Figure 5.10: Optimum casting speed as a function of the variation oscillatory mold frequency, 
when the dimension of the cast billet considered as 100 sq.mm., 125 sq.mm., and 150 sq.mm. 
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Figure 5.11; Optimum casting speed as a function of the variation oscillatory mold frequency, 
when the dimension of the cast billet considered as 100 sq.mm., 125 sq.mm., and 150 sq.mm. 
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Figure 5.12: Optimum casting speed as a function of the variation oscillatory mold frequency, 
when the dimension of the cast billet considered as 100 sq.mm., 125 sq.mm., and 150 sq.mm. 



Chapter 6 


Conclusions 


A large number of continuous casting process variables have been rigorously optimized 
in this study to yield the optimum casting speed which has not been attempted before for 
this process. The relational behavior of optimum casting speed with different operational 
parameters of billet casting obtained in this study matches well with the industrial practice. 
The optimum casting speed generally increases with the increase in stroke length and de- 
creases with the increase in vitrification ratio. The effect of liquid pool depth and flux pool 
depth on the optimum casting speed docs not appear to be very significant. For difPerent 
dimensions of billets considered in this study, the basic trends of casting speed variation 
remain the same. The results of this study indicates that the casting speed increases with an 
initial increase in negative strip time, but later it decreases when the strip time is sufficiently 
high. The casting speed decreases rapidly with the increase in solidified shell thickness, but 
eventually the rate of decrease slows down. The optimum casting speed varies proportion- 
ately with the frequency of mold oscillation for every range of billet dimensions considered in 
this study. It can be concluded that the solidified shell thickness, negative strip time, stroke 
length of mold are the major factors influencing the casting speed. 

Among different optimization procedures adopted in this study, genetic algorithms are 
found to be the most robust. From the results it appears that simulated annealing has 
predicted higher casting velocity than the other optimization methods for certain operational 
conditions. However in those cases, though the value of the equality constraint almost tends 
to zero, some correction factors api)licd in this model are not properly satisfied within their 
variable boumls, rendering those high casting velocities infeasible. On the other hand, for 




all t.lic oi)f!i'a(.ing coiitlilions genetic algoriitirns arc aljle t,o prccHcL the near opt.iina! values 
comparable with the industrial practice in most of the cases. Furthermore, its efficiencies in 
terms of the number of function evaluations and search power are found to be much better 
tlian all the other optimization techniques used here. 

In order to handle more than one objective functions, the method of objective vjeighing 
is not very efficient compared to Non- dominated Sorting Genetic Algorithms with a Pareto 
optimal formulation. Also, the code needs to be parallelized in order to lower the computation 
time, which can be undertaken during the subsequent ])hascs of this research work. 

For a further detailed analysis of the continuous casting mold, the model used in this 
study needs to be refined further, considering the non-linear temperature profiles within 
and outside the cast billet, the gap between the strand and the mold wall etc. In order to 
understand the heat transfer mechanism in details, it is necessary to consider that the gap 
between the mold wall and the strand which is infiltrated with the crystalline solid mold 
powder as well as molten flux. The viscosity of the molten flux pool has also a significant 
effect on the casting speed, which varies with temperature distribution in the mold. 

In addition to the eighteen parameters considering in this study, few more may be nec- 
essary to i)rcdict the operational variables accurately. Some of the important j)aramctcrs 
which arc not being included here are shape of mold taper, the corner radius of the mold, 
composition of the steel being cast, different mold flux composition under different opera- 
tional conditions, and the variation of kinematic viscosity of the mold flux with temperature- 
all of which need to be considered in a further refined model. The steady-state heat extrac- 
tion pattern in the mold which is considered to be linear in this study may not be the proper 
approximation of the true heat extraction pattern. Because the gap between the mold wall 
and solidified strand is infiltrated with a layer solid flux film and liquid flux film, whose 
thicknesses are varying continuously through out the operation resulting in a non-linear 
temperature profile. The convective heat transfer due to the downward motion of the strand 
is not very preciously considered. Due to the corner radius of the mold the heat transfer 
may not be uniform and even all through out the mold, which also needs to be considered 
in the formulation of the objective functions. Finally the mold shape and number of tapers 
can ahvays be varied from one operational condition to another, adding further challenges 
to a rigorous optimization of the process. 



Appendix A 


Tables 


Table A.l: The value of the constants 


PARAMETER 

VALUE 

REF. NO. 

k' 

49e-03 {KJm-^s-^K-^) 

[3, 51] 

k 

374e-03 {KJm-^s-^K-^) 

[3, 51] 

km 

2e-03 {KJm-h-^K~'^) 

[3, 51] 

kyj 

6.34e-03 {KJm-h-^K~^) 

■HfelilMI 

h 

1.42 (KJm-^s-^K-^) 

[3, 49] 

c. 

0.670 {KJKg-^K-^) 

[3, 49] 

CPL 

0.754 {KJKg-^K-^) 

[3, 49] 

Cp 

0.4187 {KJKg-^K-^) 

[3, 51] 

Cpw 

4.174 (KJKg-'^K-^) 


Hf 

0.2675 (KJKg-^) 

[3, 49] 

f^W 

6.524e-04 {Kgm~^s~^) 


Ag 

0.2e-03 (m) 

[3, 51] 

^^mold 

35e-03 (m) 

[3, 51] 

W 

lOOe-03 (m) 

[3, 51] 

t 

P 

7860 (Kgm-^) 

[3, 51] 

P 

8940 {Kgrrr^) 

[3, 51] 

Pvj 

992.2 {Kgm-^) 


Tm 

1772 (RT) 

[3, 49] 

Tp 

1822 [k) 

[3. 49] 







Table A.2: The variable bounds for the first objective function 


VARIABLES 

LOWER BOUND 

UPPER BOUND 

Yp (mm) 

10.0 

20.0 

D (mm) 

10.0 

20.0 

S (mm) 

5.0 

30.0 

F (cpm) 

15.0 

140.0 

5 (mm) 

4.0 

12.0 

N (sec) 

0.05 

0.6 

Sr (% ) 

50.0 

75.0 

w (Kgton~^) 

0.2 

0.9 


Table A. 3: The variable bounds for the second objective function 


VARIABLES 

LOWER BOUND 

UPPER BOUND 

Tu- fo (K) 

800.0 

1600 

L (m) 

0.8 

1.2 

M (m) 

0.008 

0.011 

M (m) 

0.011 

0.014 

M (m) 

0.014 

0.017 

M (m) 

0.017 

0.020 

M (m) 

0.020 

0.023 

M (m) 

0.023 

0.026 

M (m) 

0.026 

0.030 

M (m) 

0.030 

0.033 

M (m) 

0.033 

0.036 

M (m) 

0.036 

0.040 

M (m) 

0.040 

, 0.045 






75 
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Table A. 5: Results obtained for first objective function by GA(the dimension of billet 
loOsq.mm.) 


RUN NO. 
# 

Fp 

mm 

D 

mm 

S 

mm 

F 

cpm 

5 

mm 

N 

sec 

Sr 

% 

w 

(Kg/ton) 


1 

14.545 

14.70 

22.458 

119.19 

10.278 

.370 

56.79 

.858 

8.273 

2 

15.644 

15.81 

24.548 

114.73 

7.968 

.382 

50.81 

.798 

7.434 

3 

19.391 

19.69 

24.661 

118.87 

10.965 

.304 

67.90 

.756 

6.815 

4 

17.27 

17.09 

8.5505 

58.249 

11.856 

.379 

67.20 

.826 

.323 

5 

17.27 

17.09 

13.551 

58.249 

11.856 

.379 

67.20 

.826 

.8132 

6 

17.27 

17.09 

18.551 

58.249 

11.856 

.379 

67.20 

.826 

1.524 

7 

17.27 

17.09 

23.551 

58.249 

11.856 

.379 

67.20 

.826 

2.456 

8 

16.784 

16.60 

29.968 

56.267 

9.3969 

.239 

55.20 

.898 

3.770 

9 

17.27 

17.09 

19.78 

58.249 

11.856 

.379 

53.44 

.826 

2.031 

10 

17.27 

17.09 

19.78 

58.249 

11.856 

.379 

58.44 

.826 

1.911 

11 

17.27 

17.09 

19.78 

58.249 

11.856 

.379 

63.44 

.826 

1.805 

12 

17.27 

17.09 

19.78 

58.249 

11.856 

.379 

68.42 

.826 

1.710 

13 

17.27 

17.09 

19.78 

58.249 

11.856 

.379 

73.42 

.826 

1.62 

14 

16.092 

15.92 

23.884 

23.267 

11.801 

.328 

54.58 

.778 

.560 

15 

16.092 

15.92 

23.884 

38.267 

11.801 

.328 

54.58 

.778 

1.394 

16 

16.092 

15.92 

23.884 

53.267 

11.801 

.328 

54.58 

.778 

2.503 

17 

16.092 

15.92 

23.884 

68.267 

11.801 

.328 

54.58 

.778 

3.828 

18 

16.092 

15.92 

23.884 

83.267 

11.801 

.328 

54.58 

.778 

5.329 

19 

16.092 

15.92 

23.884 

98.267 

11.801 

.328 

54.58 

.778 

6.974 

20 

16.092 

15.92 

23.884 

113.26 

11.801 

.328 

54.58 

.778 

8.739 

21 

16.092 

15.92 

23.884 

46.535 

11.801 

.176 

54.58 

.778 

1.873 

22 

16.092 

15.88 

23.991 

54.035 

11.301 

.276 

61.76 

.894 

2.525 

23 

16.092 

15.88 

23.919 

54.035 

11.031 

.376 

61.76 

.895 

2.462 

24 

17.186 

16.97 

23.644 

58.707 

4.6216 

.404 

54.57 

.859 

2.0504 

25 

17.186 

16.98 

23.644 

59.000 

4.6216 

.504 

54.57 

.862 

1.9203 





Tablt! A.G: Results obtained by the Steepest Descent Method 
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Table A. 10: Results obtained for the multi-ohjcctivc Function by GA (for dimension of cast billet 150 sq 
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Table A. 12: Results obtained for the multi-objective function by DE (for dimension of cast billet 125 sq.mm.) 
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Table A.14: Results obtained for the multi-objective function by SA (for dimension of cast billet 100 sq.mm.) 
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Table A.15: Results obtained for the multi-objective function by SA (for dimension of cast billet 125 sq.mm . ) 
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Table A. 16: Results obtained for the multi-objective function by SA (for dimension of cast billet 150 sq.mm.) 
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Table A. 17: Results ohtainpd for the miilti-ohjcctive function (for dimension of cast billet 100 sq.mm.) 
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Table A. 18: Results obtained for the multi-objective function (for dimension of cast billet 125 sq.mm.) 
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Table A. 19; Results obtained for the multi-objective function (for dimension of cast billet 150 sq.mm.) 
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